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WHEN DOES THE SCREENING EFFECT HOLD? 



o 

CN ■ By Michael L. Stein^ 

C^ I University of Chicago 

When using optimal linear prediction to interpolate point obser- 
(-y-j ' vations of a mean square continuous stationary spatial process, one 

often finds that the interpolant mostly depends on those observa- 
tions located nearest to the predictand. This phenomenon is called 
pH ' the screening effect. However, there are situations in which a screen- 

^0 , ing effect does not hold in a reasonable asymptotic sense, and theo- 

retical support for the screening effect is limited to some rather spe- 
cialized settings for the observation locations. This paper explores 
conditions on the observation locations and the process model un- 
der which an asymptotic screening effect holds. A series of examples 
shows the difficulty in formulating a general result, especially for 
processes with different degrees of smoothness in different directions, 
which can naturally occur for spatial-temporal processes. These ex- 
amples lead to a general conjecture and two special cases of this 
f^ ■ conjecture are proven. The key condition on the process is that its 

QQ ' spectral density should change slowly at high frequencies. Models not 

satisfying this condition of slow high-frequency change should be used 
^_ with caution. 

^N I 1. Introduction. The screening effect is tlie geostatistical term for the 

phenomenon of nearby observations tending to reduce the influence of more 
distant observations when using kriging (optimal linear prediction) for spa- 

S>^ I tial interpolation [Journel and Huijbregts (1978), Chiles and Delfiner (1999)]. 

H ' This phenomenon is often invoked as a justification for ignoring more distant 

- - -' observations when using kriging [Memarsadeghi and Mount (2007), Emery 

(2009)]. Only in some very limited special cases is the effect exact in the 
sense that the more distant observations make no contribution to the krig- 



Received November 2010; revised April 20II. 
^Supported by US Department of Energy Grant DE-SC0002557. 
AMS 2000 subject classifications. Primary 60G25; secondary 62M30, 62Mf 5. 
Key words and phrases. Space-time process, spectral analysis, kriging, fixed-domain 
asymptotics. 



This is an electronic reprint of the original article published by the 
Institute of Mathematical Statistics in The Annals of Statistics, 
2011, Vol. 39, No. 6, 2795-2819. This reprint differs from the original in 
pagination and typographic detail. 

i 



2 M. L. STEIN 

ing predictor, so it is natural to use asymptotics as a way to study the 
screening effect. 

Let us set some notation. Write x ■ y for the inner product of commensurate 
vectors x and y. Suppose Z is a mean square continuous, stationary, mean 
Gaussian process on M'^ with autocovariance function K{x) = E{Z{x)Z{0)} 
and spectral density /, so that K{x) = f^a e^^'^ f{oj) doj. When the mean is 
assumed known to be 0, kriging is often called simple kriging. Throughout 
this work, we assume that the problem of interest is to predict Z{Q). For 
S C M"^, write Z{S) for the vector of observations (in some order) of Z 
on S, and define e{S) to be the error of the best linear predictor, or BLP, 
of Z{Q) based on Z{S). Let Nf, and F^ be two classes of sets indexed by 
the parameter e > 0, with Nf, representing observations near and F^ more 
distant observations. We will say that Nf, asymptotically screens out the 
effect of Fe if 

(1, „^E.WUF.)^^ 

Stein (2002) argues that a useful asymptotic approach is to let the smallest 
distance from the observations to the predictand tend to as e J, 0. Specifi- 
cally, Stein (2002) proves (1) when, essentially, for some xq E W^ not in the 
integer lattice, F^ is all points of the form e(xo + J) for J in the integer 
lattice, A'g is the restriction of -Fe to some fixed region with in its interior 
and / is regularly varying at infinity [Bingham, Goldie and Teugels (1987)] 
in every direction with a common index of variation. The methods used in 
Stein (2002) make strong use of the gridded nature of the observations and 
are not applicable here. Furthermore, requiring / to be regularly varying at 
infinity with common index of variation in all directions excludes models for 
spatial-temporal phenomena that exhibit a different degree of smoothness 
in space than in time. Section 4 provides further discussion of these issues. 
Ramm (2005), Chapter 5, takes a different approach to studying an asymp- 
totic screening effect by considering a process observed with white noise 
everywhere in some domain and letting the variance of the white noise tend 
to 0. In this work, we take a closer look at how the set where Z is observed 
affects whether an asymptotic screening effect holds. 

We will take the sets N^ and F^ to have a particular form that simplifies 
the asymptotic analysis. Suppose xi,...,a;„ are distinct nonzero elements 
of W^, 2/1, ... , ym are distinct elements of W^ and yo £ I^'^ is nonzero. For the 
rest of this work, let N^ = {exi ,..., exn} and F^ = {yo + eyi, . . . , yo + eUm}- 
Section 2 explores when (1) holds through a series of examples leading to 
a broad conjecture under a key assumption on the spectral density / of the 
random field: for every R<oo, 



(2) lim sup 



fi^) 



1 



0. 
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The examples will demonstrate that one generally needs a further condi- 
tion on iVe depending on the mean square differentiability properties of the 
process. For nondifferentiable processes, no further assumptions on A^^ may 
be needed. Indeed, for nondifferentiable processes on M, Theorem 1 in Sec- 
tion 3 has (1) as its conclusion under (2) and a mild additional condition 
on /. For nondifferentiable processes on M?, if one restricts the cardinality 
of N^ to 1 and of -F^ to 2 (and sets 2/2 = 0), then Theorem 2 proves (1) 
under (2) without any additional conditions on /. 

Matern models [Stein (1999a)] appear in both the examples and the proof 
of Theorem 1. Define ICi, to be the modified Bessel function of the second 
kind of order u [Olver et al. (2010)]. The Matern model on M has autoco- 
variance function (/)(a|x|)'^/C,y(a|2;|) for positive <j),a and v. The parameter z/ 
controls the smoothness of the process: Z has m mean square derivatives in 
any direction if and only if i^ > m. The corresponding spectral density equals 
0(a^ + \uj\'^)~'^~ ''^ times a constant depending on a,z^ and d. All Matern 
models satisfy (2). 

2. Examples. This section studies a number of examples to gain some 
insight into the conditions on / and A'g that are needed in order for (1) 
to hold. The derivations of these results are elementary but not necessarily 
easy. Rather than give detailed derivations of all of them, I will outline 
derivations in a few of the more difficult examples in Section 5.1. 

To see why a condition like (2) is needed, let us first consider an exam- 
ple on M addressed in Stein and Handcock (1989) and Stein (1999a), pages 
67-69. Suppose n = l, xi = l, m = 2, yQ = l, yi = and 2/2 = 1; see Figure 1. 
Consider K{x) = e~'^', a Matern model with smoothness parameter 2- The 
corresponding process is mean square continuous but is not mean square 
differentiable, and it is easy to show Ee{N^)'^ ~ 2e as e 1 0. This process is 
Markov, so that Ee{NeUFe)'^ = Ee{Ns)^ for all e < 1 and (1) holds trivially. 
Next consider K{x) = (1 — Ixl)"*" (where the superscipt + indicates positive 
part), for which f{uj) = ^^°^^ , which does not satisfy (2). Stein and Hand- 
cock (1989), page 180, give the BLP based on ^(A'jUFg), from which it 
is not difficult to show that Ee{Nf,)'^ ~ 2e, just like for K{x) = e"'"^', but 
Ee{Ne U Fef ~ |e as e i so that Ee{Ne U Fef/Ee{Nef ^ | as e i 0. The 
choice of yo = 1 is critical here: for yo 7^ 1 but positive (keeping xi = l,yi = 
0,^2 = 1), Ee{Ne U Fef/Ee{Nef -^ I as e ^ 0. The anomaly for yo = 1 is 
related to the lack of differentiability of K[x) at x = 1, which is in turn re- 

+ • 00 

I \ 1 1 

£ 1 1 +e 

Fig. 1. Prediction problem for triangular autocovariance function. Prediction site 
(+ sign), nearby observation (solid circle) and distant observations (open circles). 
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Fig. 2. PredicUon problem for autocovanance function K{s,t) — e^'"''*' . Symbols as in 
Figure 1. 

lated to the oscillations at high frequencies in /. See Stein (2005) for further 
discussion on the relationship of the differentiability of K away from the 
origin and the high-frequency behavior of /. 

Proposition 1 in Stein (2005) provides a second example showing why 
a condition like (2) is needed to have a screening effect. The following 
special case of this result suffices to illustrate the point. Suppose Z is 
a stationary process on M^ with autocovariance function K{s,t) = e~''^''''' 
for s,t G M. The corresponding spectral density f{uJi,u)2) is proportional 
*° TTT — mrir^i which does not satisfy (2). Consider the situation pic- 
tured in Figure 2, for which xi = (0,1), yo = (l^O), yi = (0,1) and y2 = 
(0,0). Then using either direct calculation or Proposition 1 in Stein (2005), 
lim,^oEeiN, U Fe)yEe{N,)^ = 1 - e'^. 

The remaining examples all consider / satisfying (2). To see why an ad- 
ditional condition on N/, is needed for (1) to hold for differentiable pro- 
cesses, consider a Matern model with smoothness parameter | : K{x) = 
e~'^'(l + |x|), for which the corresponding process is exactly once mean 
square differentiable. For N,. = {e}, F^ = {1} (top plot in Figure 3), straight- 
forward calculations yield Ee{NsY ~ ^^ ^'^'^ Ee{Ne U F^)^ ~ f^^^ as e 1 

so Ee{Ns U Fef / Ee{Ni;)'^ — > ^^ as e J, 0. Unlike the triangular case, there 
is nothing special about yo = l here and the more general result for yo > is 
Ee{Ne U Fef/EeiNef -^ 1 - yl/{e^y° - 1 - 2yo - yo)- The reason the limit 
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Fig. 3. Prediction problems for Matern model with v ~ ^ on R. Symbols as in Figure 1. 
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is less than 1 is not because there is anything unusual about /, but rather 
that N^ is inadequate. Specifically, since Z{0) = Z(e) — eZ'{0) + Op(e) and 
cov{Z{e),Z'{0)} — 7- as e 4, 0, it is apparent that having even a somewhat 
informative predictor for Z'{0) would provide useful information about Z{0) 

not contained in Z{£). In fact, as £^,0, it is possible to show that Z'{0) = 
-^TziiZ{e) — -^^Z(\) is an asymptotically optimal predictor of Z'{^) based 

on (Z(e),Z(l)) and, in turn, that Z{e) — eZ'(O) is an asymptotically opti- 
mal predictor of Z(^) based on (Z(e),Z(l)). A screening effect does hold 
if 2e is added to N^ (bottom plot of Figure 3). Then it is possible to show 
that Ee{Ns U F^)^ ~ Ee{N^Y ~ |e^ as e 4, 0, so (1) is true. Furthermore, 
as e I 0, 2Z(2e) - Z{e) = Z{e) - e[{Z{2e) - Z{e)]/e\ is an asymptotically 
optimal predictor of Z(0) based on Z{Ne U F^) and {Z{2£) - Z{e)}/£ is 
a consistent predictor of Z'[{)). A reasonable conjecture for a process on M 
with exactly p mean square derivatives whose spectral density satisfies (2) 
is that any distinct xi, . . . , x^ with n> p suffices to make (1) true. 

It is helpful to consider this problem in the spectral domain. We need 
some further notation to proceed. For nonnegative-valued functions a and b 
defined on a common domain D, write o(x) <^ b{x) if there exists finite C 
such that o(x) < Cb{x) for all x G L* and, for x G M, a(x) ^ b{x) as x | xq 
if, for some c > 0, a(x) ^ b{x) for D = (xo,xo + c). Write a(x) x h{x) if 
a(x) <C h{x) and b{x) <^ a{x) and define a(x) x 6(x) as x 4, if a(x) ^ 
6(x) as X I xo and 6(x) ^ a(x) as x^xq. For a complex- valued function g 
and a nonnegative function / defined on a domain D (always W^ here), 

define \\g\\f = \ J j^\g{x)\'^ f {x) dx . To each random variable of the form 

X]?=i ^j^i'^j) there is a corresponding function J21=i Aje*'^'*^ , and the map- 
ping is an isometric isomorphism in the sense that E{'^"'^^XjZ{sj)}'^ = 
kd IE"=i Aje*^-^^f /(^) duj. Write J2]=i (kjeZ{£Xj) for the BLP of Z(0) based 
on Z{N^) and (pei^) = X^,- 4>jeG^'^^'^^ for the corresponding function. If we set 
r?e(w) = 1 - M^), then Fe(A^^)2 = \\i],\\j. 

For any ^ C M'', caU f^\r]e{uj)\'^f{io)duj/\\r]e\\j the fraction of Ee{Ne)'^ 
attributable to the set of frequencies A. Write b{r) for the ball of radius r 
centered at the origin. For the scenario in Figure 3(a), for any fixed ujq > 0, 
as £^,0, 

,,, kuo)\VeH\^fiu^)du: 2/ _i ^0 K n 

(3) — ^ — - — - — t;5 ~— <tan ujQ o^>0 

so that an asymptotically nonnegligible fraction of Ee{Nir)'^ is attributable 
to a fixed range of frequencies. Similar to the definition of %, let tp^ be 
the function corresponding to e{N£ U F^), so that Ee{Ni; U F^)^ = HV'sHf • 
Then (3) allows Z[l) to improve the prediction nonnegligibly by making 
|^£(a;)P/|?7e(a;)P substantially smaller than 1 in a neighborhood of the ori- 
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gin. In contrast, for the scenario in Figure 3(b), /^^ -, |%(ti;)p/(a;) du <^ e^ 
as e I for any fixed tJo, so that /^, Arii,(^bj)\^ j (ui) duj <^ e\r\^^, as e J, 
0. In this case, making |^e(a;)p/|?7e(a;)p substantially smaller than 1 in 
a neighborhood of the origin cannot yield a nonnegligible asymptotic im- 
pact on the mean squared prediction error. Thus, Jw^ y |'(/'e(w)P/(w) da;/ 
Ki^ Nc |??£(w)p/(a;) do; must be bounded by some constant less than 1 as 

e 4- for all wq for (1) not to hold. The fact that / is well behaved at high 
frequencies [i.e., satisfies (2)] effectively precludes this possibility so that (1) 
holds. This line of reasoning forms the basis of the proof of Theorem 1; see 
Section 5.2. 

It is interesting to reconsider the two cases pictured in Figure 3, for a pro- 
cess that is not quite mean square differentiable: K{x) = \x\}Ci{\x\), a Matern 
model with smoothness parameter 1, for which K{x) = 1 + 23^^1og(2l^l) + 
^(27 — l)x^ + 0(x^log|x|) as X — >■ with 7 being Euler's constant. The corre- 
sponding spectral density / is proportional to {l + uP')"'^''^ . Since the process 
has no mean square derivatives, I conjecture that (1) should hold for any 
nonempty N^. For the scenario in Figure 3(a), Ee{Ng)'^ ~ — e^loge and, for 
fixed uJo> 0, 

I r M2rr .. f \l-e'''^\^ + {K{e)-l}\ 2 

\7]s{u;)\'f{uj)duj- / J ,J , \.^' '—dcv^e' 



/feM) Jb{ujo) (l+a;2)3/2 

as e^O. Thus, the fraction of the Ee{N(,)'^ attributable to ^(wo) tends to 
as e 4 0, although at only a logarithmic rate. Not coincidentally, direct cal- 
culation shows that for F^ = {!}, (1) holds and I would expect it to hold for 
more general F^. In fact, Theorem 2 in Section 3 applies in this case and it 
follows that (1) holds when F^ has two points (and 2/2 = 0). 

Next consider some settings for the Matern model with z^ = | on M^ . Fig- 
ure 4(a) shows a situation in which there are two nearby observations in the 
vertical direction from the origin and two distant observations in the hor- 
izontal direction. One might imagine that because the nearby observations 
provide no information about how the process varies in the horizontal di- 
rection, the distant observations might provide nonneglible new information 
about Z{Q). However, Section 5.1 demonstrates that (1) does hold in this 
case. The next two examples are related to the one-dimensional examples 
considered in Figure 3 for a Matern model with v =^. Write Zij for the ijth 
partial derivative of Z. In Figure 4(b), N^ has three observations, but they 
are collinear along a line that does not go through the origin and it is pos- 
sible to show that the BLP of Zi^o(0,0) based on Z{Ng) has asymptotically 
negligible correlation with Zi^o(0,0) as e 4,0. As a consequence, the asymp- 
totic results are identical to what we had in Figure 3(a): Ee{Nir)'^ ~ e^ and 
Ee{N^ U Fe)^ ~ ^Ei^^ &s e I 0. \i N^ has three points arranged as in Fig- 
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Fig. 4. Prediction problems for isotropic Matern model on R^ with v - 
Figure 1. 



Symbols as in 



ure 4(c), then {Z{e,0) — ^Z{2e,e) — ^Z{2e, —e)}/e is a consistent predictor 
of Zi^o(0, 0) and (1) holds; see Section 5.1. 

Now consider a model satisfying (2) for which the process is not equally 
differentiable in all directions. Stein (2005) gives an example of such a model. 
Specifically, consider a space-time model on M^ x M with spectral density 
{(1 + jwip)^ + a;|}~^, {uji,u!2) G M^ X M. Writing erfc for the complementary 
error function, the corresponding autocovariance function K is Stein (2005) 

i.(x,t)4.^eNerfc(|.,V^ + J^)(l-N + f) 



(4) 



+ 16^ ^ '''^'^V^l 2|i|i/2 

+ ^7r \t\ exp(^ \t\ ^1^1^ 



, , 4i2 

l + \x\- -— 
\x\ 
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Fig. 5. Prediction problems for autocovanance function given in (4). Horizontal axes 
are first spatial coordinate and vertical axes are time. Symbols as in Figure 1. 



for 2: 7^ and t ^0. For a; = or t = 0, we can define K by continuity. For 
t = 0, we get K{x,0) = |7r^e~l^l(l + |x|), the Matern model with z/ = |, so 
the corresponding process is exactly once mean square differentiable in any 
spatial direction. Stein (2005) shows that K{0,t) = ivr^ - |7r3/2|t|3/2 _^(9(^2) 
as t — 7- so that K{0,t) is not twice differentiable in t at t = 0, and the 
corresponding process is not mean square differentiable in time. 

For (4), let us again consider the setting in Figure 4(c) with the horizontal 
axis corresponding to the first spatial coordinate and the vertical axis corre- 
sponding to time. It now turns out that the two points in N^ off of the hor- 
izontal axis contribute negligibly to the BLP whether or not -F^ is included. 
The problem is that the lack of differentiability of Z in the vertical direction 
implies that the BLP of Zi^q{0, 0) based on Z[Ni.) has asymptotic correlation 
with Zi^o(0,0). Consequently, the asymptotic results are the same as in 
Figure 3(a) for K{x) = e-\''\{l + \x\)] that is, Ee{NeiJFef /Ee{Nef -^ f^f 
as e 4,0 (Section 5.1). 

Figure 5 displays two other settings we now consider for K as in (4). 
In Figure 5(a), we have Ee{Ne VJFef/Ee{Nef -^ 1 as e i and, in Fig- 
ure 5(b), Ee{Ne U Fef/Ee{Nsf -^ f^ as e 4, 0. These two cases show that 

it is possible to have sets Nf, C N^ yet have that (1) holds for the pair of 
sets {N^,Fi:) but not {N^,F^), further complicating any search for a general 
result that applies to processes that are not equally smooth in all direc- 
tions. 

These examples demonstrate that any general theorem that encompasses 
all of them will need a condition on N^ that depends on /. The following 
conjecture is in accord with all of the examples presented here: 

Conjecture 1. Suppose / satisfies (2) and the following assumption: 
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Assumption A. for j = 1, . . . ,n, all mean square derivatives of Z at 
the origin in the direction Xj can be predicted based on Z{N^) with mean 
squared error tending to as e J, 0. 

Then for all r > 0, 

lim ^^%^^\' ' = 1. 

£4.0 Ee{N,y 

Note that here I have expanded the set of distant observations to include 
all locations more than r from the origin, which simplifies the statement 
of the result although undoubtedly complicates its proof (assuming it is 
true). It is somewhat unsatisfying to have the condition on N^ given in 
terms of properties of predictors of derivatives of Z rather than some purely 
geometric condition, but I see no way to accommodate the examples treated 
here for K as in (4) without a condition something like Assumption A. 
Verifying whether Assumption A holds in any particular setting may require 
a fair amount of work, although for N^ of fixed and finite cardinality as we 
consider here, it should generally be possible to make this determination. 
Note that if all mean square derivatives of Z at the origin can be consistently 
predicted based on Z{N^) as e ^ 0, then Assumption A holds for any Nf, = 
{esi,...,esi} with {xi,...,Xn} C {si,...,Si}. 

In all of the examples for which (1) holds, 

for all cjQ > 0, and I suspect that Assumption A is equivalent to (5). Exam- 
ining the proof of Theorem 1 in Section 5.2 [see (19)], one sees that (5) is 
essential to making the proof work. 

3. Theorems. I do not know how to prove Conjecture 1 in anything like 
its full generality. Assuming Z is not differentiable in any direction simplifies 
matters considerably, because Assumption A then holds for any nonemp- 
ty iV^. Theorem 1 considers nondifferentiable processes on M and Theorem 2 
nondifferentiable processes on M?. 

Theorem 1. Suppose, for d= 1 and some a G (0, 2), 
(6) /(a;)x(l + M)-"-\ 

and f satisfies (2). Then (1) holds. 

Condition (6) is stronger than necessary to guarantee Z is not differen- 
tiable. Because part of the proof is to show that the low frequencies do 
not matter in the limit, (6) can likely be weakened to hold only for all w 
sufficiently large. Removing (6) entirely would be more difficult. 
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The next theorem appHes to nondifferentiable processes in M? and does 
not require any conditions on / beyond (2). However, it does restrict Nf, to 
have only one point and -F^ to have two. The theorem also assumes y2 = 0, 
but this restriction does not meaningfully detract from the content of the 
result and, in any case, could be removed at the cost of a somewhat messier 
proof. Extending the result to M is straightforward, but taking d > 3 is 
pointless in this setting because any 4 points in M.'^ fall on a three-dimensional 
hyperplane, and even taking d = 3 provides no new insight beyond what is 
learned from the two-dimensional setting. 

Theorem 2. Suppose Z has spectral density f satisfying (2) and that Z 
is not mean square differentiahle in any direction. In addition, suppose Nf, = 
{exi} and F^ = {yQ,yQ + eyi}, where xi,yQ and yi are all nonzero. Then (1) 
holds. 

Note that the example referred to in Figure 2 satisfies the conditions on A^^ 
and F^ in Theorem 2, and the process is not mean square differentiahle in 
any direction, but / does not satisfy (2). As we have seen, (1) does not hold 
in this setting, so that Theorem 2 would be false if we removed (2). 

Throughout this work we assume that Z has a known mean 0. It is com- 
mon in practice to assume that Z has an unknown constant mean fi and 
then predict Z{0) by what is called the ordinary kriging predictor, which 
is just an example of the best linear unbiased predictor [Stein (1999a)]. In 
all of the examples considered in Section 2, for which (1) holds for simple 
kriging, it still holds for ordinary kriging. Furthermore, Theorems 1 and 2 
can be easily shown to hold for ordinary kriging by proving that, under the 
conditions of the theorems, the ordinary kriging predictor based on Ng, is 
asymptotically optimal relative to the simple kriging predictor (see the ends 
of each proof in Section 5). Thus, if Conjecture 1 holds for simple kriging, 
then I would expect it also holds for ordinary kriging. 

4. Discussion. The space-time process on M^ x M considered in Section 3 
with spectral density {(1 -|- jwip)^ -I-cjI}"^, (^1,^2) G M'^ x M, is an example 
of a process with a different degree of differentiability in time than in space. 
It is a special case of the stochastic fractional heat equations studied by 
Kelbert, Leonenko and Ruiz-Medina (2005), which are in turn a special case 
of a class of space-time processes suggested in Stein (2005) whose spectral 
densities are of the form 

(7) /(^i,^2) = {ciiaj + \uifr + C2{ai + \i^2?rr'' 

for cji G M'^i, UJ2 G M'^^ ^ > ^ + ^ a'^d ci, 02,01,02 and a\ + a^ positive 
to ensure / is integrable. Because of the superficial similarity of this model 
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to the Matern model, we might call it doubly Matern. All spectral densities 
of the form (7) satisfy (2) and thus, I conjecture, satisfy an asymptotic 
screening effect whenever Assumption A applies to N^. At the same time, 
by adjusting the parameters ai,a2 and v, we can obtain processes with 
any desired degree of differentiability in time and any separate degree of 
differentiability in space [Stein (2005)]. Note that / of the form (7) satisfies 

the conditions of Theorem 2 when di = do = 1, 2j/ < — H and 2iy < 

h — , the last two conditions being necessary and sufficient to make Z 

not mean square differentiable in any direction. Stein (2011) derives some 
results for the covariance structure when ai = 02 = and Q2 = 1 . 

Despite its flexibility, model (7) is still restrictive in some ways, in par- 
ticular in exhibiting what Gneiting (2002) calls full symmetry, due to the 
fact that f{(jji,uj2) = /(wi,— ^2), and hence the corresponding process has 
the same covariance structure with time running backwards as it does with 
time running forward. Thus, for example, this model is unsuitable for pro- 
cesses with a dominant direction of advection. Stein (2005) discusses possible 
approaches to extending this model to allow for asymmetries. 

As noted in Section 3, (5), which says that only an asymptotically negli- 
gible fraction of Ee{N£)'^ can be attributed to some fixed frequency range, 
is crucial to obtaining a screening effect. This same property was also the 
key idea in Stein (1999b) to obtaining explicit results on the asymptotic effi- 
ciency of predictors based on an incorrect spectral density having similar be- 
havior to the correct spectral density at high frequencies. The high-frequency 
behavior of a Gaussian process is also crucial to estimation of the covariance 
structure [Stein (1999a)], and misspecification of this high-frequency behav- 
ior can lead to poor behavior of estimates, particularly if likelihood-based 
methods are used [Stein (1999a), Chapter 6, and Stein (2008)]. As statisti- 
cians strive to advance the statistical analysis of spatial-temporal processes, 
they should pay close attention to the spectral behavior of the models they 
use. In particular, models that do not satisfy (2) should be used with caution. 

5. Proofs. 

5.1. Examples. For a random vector Y , write cov(y) for the covariance 
matrix of Y , write for a column vector of zeroes whose length is apparent 
from context and denote transposes by primes. The following result simplifies 
the calculations for several of the examples. 

Lemma 1 . // there exists a(e) > 0, 5e^ ]R"+™' and A^ an (n + m) x (n + m) 
matrix such that 

(8) limcov^^^)^^^^)-^^-^^^^^^^)^^^^^ 0' 

i»j nmcovl A,Z(iV,UF,) J [o K 
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for some k > and K positive definite, then 

E{Z{Q)-5e-Z{E)Y 



lim- 

£4,0 



Ee{N,VJF,Y 



1. 



For (8) to hold, e^ = Z(0) - 5e ■ Z{Ne U F^) must satisfy Ee{Ne U Fef / 
Eel ~^ 1 as e 4, 0. To prove the lemma, note that (8) and K positive definite 
imply cov{AeZ(e)} is positive definite for all e sufficiently small. Thus, for 
all e sufficiently small, the BLP of ^(0) based on Z{N^ U F^) is the same as 
the BLP of Z{0) based on A£Z{Ni.{JFir). Since matrix inverse is a continuous 
function in some neighborhood of K, using basic results on BLPs [e.g.. Stein 
(1999a), Section 1.2], 

a{efEe{NeUFef 

= a(e)2(var e, - covje^, Z(A^, U F,)'A^J 

X [coy{AeZ{N, U F,)}]-i cov{A,Z(iV, U F^), ej) 

= var{a(e)ee} - cov{a{e)ee,Z{Ns U FeYA'^} 

X [cov{A,Z(iV, U F,)}]'^ cov{A,Z(iV, U F^), a(e)ej 

^ A; - O'ET-^O 

as ej, 0, and the lemma follows. 

To apply Lemma 1 to the setting in Figure 4(a) with K{x) = 
it suffices to show 

/e-3/2{Z(0,0) - 2Z{0,e) + Z{0,2e)}\ 

Z{0,e) 

e~HZ(0,2e)-Z(0,e)} 

Z(1,0) 

e-^{Z{l + e,0)-Z{l,0)} 



e-|^l(l + |x|), 



lim cov 

eiO 



V 



/ 3 






1 







2e-i 



— e' 








1 











2e-i 





1 





Vo 


-e-i 








1 



To show, for example, that cov[e-H^(0,2e) - Z{0,e)},e-^{Z{1 + e,0) - 
Z(1,0)}] —7- 0, define the function K on [0,oo) by K{r) = e~^'{l + r), which 
has bounded derivatives of all orders on [0,oo). Then using a Taylor series, 

cov{Z(0, 2e) - Z(0, e), Z{1 + e, 0) - Z(l, 0)} 



A-(V(l + e)2 + 4e2) _ K{,/{l + ey + e^) 



K{Vl + 4e^) + K{Vl + ^) 
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K'il + e)W{l + e)2 + 4e2 _ ^(i + e)2 + ^2} 

- K'{l){Vl+4e^- \/l + e2} + O(e^) 

2e2 £2 



K'(l + e)(l + e: 



(1 + ^)2 2(l + e)2 



K'(l)(2e2-ie2J+o(,4) 



^e-" 



and cov[e~'^{Z{0,2e) - Z{0,e)},e~^{Z{l + e,0) - Z(1,0)}] ^0 follows. 

Lemma 1 can be applied to the setting in Figure 4(c) with K{x) 
e~'^'{l + |rE|) by showing 

fe-^/^Z{0, 0) - 2Z{e, 0) + ^Z{2e,e) + iZ(2e, -e)} N^ 

Zie,0) 

e'^{2Z{e, 0) - Z(2e, e) - Z{2e, -e)} 

Z{1,0) 

1{WV5-8V2) 

1 2e-i 



lim cov 



V 



/ 








2e- 



-2e- 



-2e- 
1 



Specifically, it is not necessary to consider Z{2e,e) and Z{2e, —e) separately: 
by symmetry, the BLP of Z(0,0) based on Z{N^ U F^) depends on Z(2e,e) 
and Z{2e, -e) only through Z(2e,e) + Z{2e, -e). 

As a final example, let us apply Lemma 1 to the setting in Figure 4(c) 
with K given by (4). Again by symmetry, we can restrict to predictors that 
depend on Z{2e,e) and Z{2e,—e) only through Z{2e,e) + Z(2e, — e). For a 
and b fixed and positive, using a Taylor series and 



erfc(x) 



1 



2 ^x-ix3)+0(|x| 



as x — )• [Olver et al. (2010), page 162], it is possible to show 

K{ae, be) = ivr^ - Kirbef/^ + 0{e'^) 

as e 4, 0. This result also holds when a or 6 equals 0. It follows that 

/ e-HZ{0,0)-Z{e,0)} 

lim cov Z{e,0) 

^4-0 I ^^3/i^2Z{e, 0) - Z(2e, e) - Z{2e, -e)} 



W 












1(2-^2)^3/2 
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so that Z{e, 0) is an asymptotically optimal predictor of Z{0, 0) based on N,.. 
Furthermore, for ci = 2/(e^ — 4) and C2 = — e/(e^ — 4), 

/ £-1(^(0,0) - (1 + cie)Z(e,0) - C2eZ{l,0)} ' 

Z(e,0) 

Z(1,0) 
V e-3/4{2Z(e, 0) - Z(2e, e) - Z(2e, -e)} 



lim cov 



/I 2' 







— 



V 













4e 
1 2 














\ 



V2)tt^/^J 



and the conditions of Lemma 1 are satisfied. 



5.2. Proof of Theorem 1. Theorem 3.1 in Xue and Xiao (2011) implies 

(9) \\iie\\) = Ee{N,f^e'' 

as e 1 0. Let us use (9) to show that Yll=i\'i''je\ is bounded in e as e J, 0. If we 
define M^ = max(l,^^^-|^ \4>j£\) and xq = 0, we can write r/e(c;j) in the form 
Mj ^"^Q //jee^'^'^^j for appropriate /ije's, where, by construction, \^je\ < 1 
for all j and e. Thus, if we can show Mg bounded, then X]?=il'^jel is also 
bounded. By (6), there exists < Ci < C2 < oo such that 

(10) 7 %r-T < /(^) < 7 ^\ ^1 

for all uj. Thus, making the change of variables v = eu) in the second step, 

2 



j-C 

Ee{Nef>CiM^ / 
J ~i 



j=0 



(11) 



CiMh 



2^a 



n 
3=0 



(l + l^l 



ie + \u\ 



-a~l 



duj 



-0-1 



dv 



>CiM^'{^e 



j=0 



-Q-1 



dv 



for all e < 1. Suppose Mg is unbounded. Then there exists a sequence {e(A:)} 
tending to such that M£(fc) — )• 00. Because the /ije's are bounded, there 
exists {fiQ, . . . ,fj,n) S M"+-'^ and a subsequence of {e(A;)}, call it {e{kg)}, along 
which ifioeiki),- ■ ■ , ^^ne{k^)) "^ (/^o, . . . , /in) as £ -)> OO. Since a > 0, by domi- 
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nated convergence, it follows that 

2 






j=0 



u-'^-^du 



j=0 



z^-"-Mi/>0 



as £— )-oo, which, together with (11), contradicts (9), so Mg and J21=i\'t'j£\ 
must be bounded as e 1 0. 

Now consider the behavior of % at low frequencies. Define Ps = 1/ Ylj (pje 

and fie{^) = I - PeT,]=i(l)jee'^'^'-'^ . By (6) and (9), j^ \rjs{uj)\^ dco ^.e"" and, 
writing Re for real part, |%(a;)p > {Re%((^)}^ = (1 — p^)^ + 0(e^) uniformly 
for (x! G [0, 1] . It follows that 

(12) (p,-l)2«e° 

as e 1 0. Using le*"" - 1| < |x| for ah x G M, for /3 G [0, 1] and a £ (0, 2), 



(13) / \U^)\^fi^)du:<^e\ 

Jb{e-^) Jo 1 + W°+^ 



^-' ^2 



dL.« £2-/3(2--) 



as e 1 0. Because X]"^^ 't'jeZ{exj) is the BLP of Z{Q), ||??e||^ < ||%|| p so that 

(14) --^"^ ^'''-'' 

+ [ {\U^r-Mu;r}f{co)du;. 

Jb(e-PY 

Straightforward algebra shows 

\U^)\^-\Ve{^)? 

(15) = {pI - 1)|(/>,(^)|2 - 2{p, - l)Re(/>,(^) 

= 2{pe - l)Ve(w)|' + 2(p. - l)[|<Ae(^)|' " Re</..(^)]. 

The boundedness of the 0je's in e implies ||(/)e(6(j)p — p^^p <^vain{\,e^ijj'^) 
and |Rei?i)e(a;) — p7"^l ^ min(l,e2Lj2), and it follows that 

\\(Pe{oj)\'^ - Re0e(tj)| < |pg - 1| +min(l,e2tj2) 
as £^,0, which, together with (12) and (15), yields 

|f?,(u;)|2-|,7,(a;)|2«e" + e"/2niin(l,eV) 
as e^O. Thus, 



Jbie-PY 



(16) 



lb{e-P) 



^-^e"+e2+a/2^2 .oo ga/2 

■ doJ + I — — r-r duj 



-0 OJ 



a+1 



a Ul 



a+1 



«e3a/2^^a(/3+l) 
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as e ],0. Combining this bound with (13) and (14) implies that for all /3 G 

[0,1], 



(17) 



b{e-^) 



|r?,(a;)|2/(^) du « £2-/3(2-.) ^ ^3a/2 ^ ^a(/3+l) 



as e^O. Note that the bound in (17) is o{£'^) as e ^ for all a G (0,2) and 
/3g(0,1). 

Let Ag = (Ale, • • • , Ame), and assume A^ / hereafter, as the case A,. = 
is trivial to handle. We next show the correlation of e^N^) and A^ • Z{F^) is 
asymptotically negligible. Defining Xe^uj) = J2T=i -^jeC*^'^^^, 

corr{e(7Ve),A.-^(i^.)} 



(18) 



/,(,-,) r?,(a;)e--'-^°Ae(a;)/((^) do; 



+ 



/b(,-,,)cr/e(w)e— ^oAe(o;)/(^)do. 



\\%\\fUe\\f 

Using the Cauchy-Schwarz inequality and (17), for j3 G (0, 1) 



(19) 



h < - — ^ — ^ > 



l%ll/ 



as e 4- 0, uniformly in A^. Next, define Rk = 2'Kk/yQ and /c^ = \_yQ£ '^/(27r)J . 
Then Rk, < e~^ and 



r^e{uj)e-''^y^Xe{oj)f{uj)doj 



(20) 



h{e-PY 

oo 

k=k, •'"fe 

oo 

< 2 Y. f{Rk 



>-fc+i 



iU^)e~'^y''\e{u:)f{u:)du 



-R*4 



77e(^)e-*"^°Ae(L^)da; 



^fe 



+ 2E / '*\ve{i^)U^)\\f{^)-f{Rk)\duj 
k=ke -^^fe 

For u G (i?fc, i?fc+i], by (2), there exist constants c^ — )■ as fc — )• oo such that 
(21) \f{uj)-f{Rk)\<Ckmm{f{Rk),f{u:)}, 
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SO 






/>oo 

(22) < 2 sup Cfe / \r]e{io)\e{'jj)\f{u})dio 

k>ke JRk^ 

< SUpCA;||r/e||/||Ae||/, 
k>ks 

the last step by the Cauchy-Schwarz inequality. Now consider I^ in (20). 
Defining Os{uj) = r]i;{u})Xi;{uj), we can write 0^ in the form Y^^=i Oj^e^^'^^K 
For Me = max(l,^"^-^|(/)je|), let M = lim sup^j^g -^e ' which we showed is 

finite. Then, setting Lg = YlT=i\^je\) it is easy to show that '}2'7=i rjel — 

(2M + 1)L£ for all e sufficiently small. Integrating by parts, 

/•Rk+i p-iRkVo 

/ 9,iuj)e-''^y<^ div = — {e,{Rk) - 9siRk+i)} 

jRk Wo 

1 fRk+l 

lyojR^ 

Defining z = maxj \zj\, we have | Oe (Rk ) - 0e {Rk+i) \<T.j\ ^je 1 1 1 - e*^''^^^ /^° | < 
27r(2M + l)zLee/yo and \9'^{uj)\ < (2M + l)zLse for ah e sufficiently smah, 
so that 

cRk+l 

:{uj)e~''^y'' du 

' Rk 



<^T{2M + l)zLee 
Vo 



(23) 

for all e sufficiently small. Setting /3 = 2; inequalities (10) and (23) imply 

oo 

h<2C,{2M + l)zL,Y.^^ 
(24) 



<2a~^C2(2M + 1)zlJ — ] 

\yoJ 



for all £ sufficiently small. Similarly to (9), it is possible to show L^e"'^ <^ 
WXeWf as e|0, so that by (9) and (24), 

(25) „ /' „ «e^-"/^ 

WVeWfWKWf 

as e 4, uniformly in A^. Since q < 2, this bound tends to uniformly in A^. 
Applying (22) and (25) to (20) yields I2 [defined in (18)] tending to as 
£^,0 uniformly in Ag, which together with (18) and (19), implies 

(26) limsup|corr{e(iVe),A£-Z(Fe)}| =0. 
£4-0 A, 
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To finish the proof, it suffices to prove e{Ni;) is asymptotically uncor- 
related with all linear combinations of Z{N^ U -F^). Specifically, defining 
^e = (6e, • • • jCne), if we Can show 

(27) lim sup \corr{e{N,),A, ■ Z{F,) - E, ■ Z{N,)}\=0, 

then the theorem follows since 

^y;,^y = 1 - sup corr{e(iV,), A, • Z(F,) - S, • Z{N,)}'. 

Because e^N,.) is the error of a BLP based on N;,, corr{e(A'^j), H^ • Z{Nir)} = 
for all Hg. Thus, (27) follows from (26) if 

(28) var{A, • Z(F,)} < var{A, • Z(F,) - E, ■ Z{Ne)} 

uniformly in Ag and E^. There is nothing to prove if Hg = 0, so assume H^ 7^ 
hereafter. Consider the Matern spectral density fa{^) = (l + f^^)"^""^"^''^, for 
which the corresponding autocovariance function is Kaix) =Ca|x|""/C„/2(|x|), 
where c^, = ir^/"^ /{2"/^-'^r{{a + l)/2)} [Stein (1999a), page 31]. I wih write 
the subscript a to indicate quantities such as variances calculated under K^ ■ 
Since, by (6), /(w) x fa{oj), (28) is equivalent to 

(29) var,{A, • Z(F,)} < var,{A, • Z(F,) - H, • Z{Ne)] 
uniformly in Ag and Hg, which is in turn equivalent to 

(30) limsup sup |corr«{A, • Z(F,),H, • Z{Ne)}\ < 1. 

£4,0 Ae ,=£ 

Define A.e= J2T=i^js, ^je = >^js-:^>^-e, Le= EJLilVI and A^ = (Ai^, . . . , 
Xme)- Using the series expansion for Ka [Stein (1999a), (15) page 32] and 
setting ba = 7r/{r(a + 1) sin(i7ra)} and Sa{e) = - I]jjc=i ^je>^ke\yj - Vkl", 

var„{A, • Z{F,)} - 6«e"5«(e) < e^Ll 

Now Sais) is nonnegative because YlT=i-^je = 0) aiid l^^l" is a valid vari- 
ogram for a G (0, 2) [Stein (1999a), page 37]. Furthermore, if Lg / 0, Sa{£)/Ll 
is trivially bounded from above. It is also uniformly bounded from below: if 
Sa{£)/Ll tends to a limit along any sequence of e values, then there is a fur- 
ther subsequence along which Ag/Lg converges to some A = (Ai, . . . , A^) 7^ 
and, along this subsequence, by dominated convergence, 



^^^□.(e) 



m 



2 



w|-"-Mcj>0. 



Li 

Thus, no subsequence of Sa{e)/Ll can have as its limit and 
(31) var„{A,-Z(F,)}xe"L2, 
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which holds even if L^ = 0. Again using the series expansion for Ka, 
\coVa{Z{yo + e2/i),Ae • Z{F^)}\ < e^^L^, so that corro{Z(yo + eyi),K ■ 
Z{Fe)]^^ uniformly in A^ / 0. Thus, 

(32) var„{A, • Z(F,)} ~ \Ik^{^) + var„{A, • Z{F,)] 

as e 4, 0, uniformly in A^. Results similar to (31) and (32) apply to H^ • Z(Ns). 
Next, define Ce = Y.]=i^js, Ije = ^je - ^Ce, Xe = Y.]=i\ije\ and Es = 
(fie, • • -line) and consider 

cov„{A,-Z(F,),S,-Z(Ar,)} 

(33) = X.eCeKaiVO + ^Vl " ^^i) + A.^ COVa{Z{yo + £yi),Ee • Z{Ne)} 

+ isCov^{Z{exi),A, ■ Z(F,)} + cov„{A, • Z(F,),H, • Z{Ns)}. 

Since K^ has a bounded second derivative outside of a neighborhood of the 
origin, it is straightforward to obtain the following bounds: 

\X.eieKa{yo + ^yi - £Xi) - X.sieKa{yo)\ < e\\.e\\ie\, 

\X.eCOVc.{Z{yo + eyi),Ee-Z{N,)}\'^e\\.e\X,, 

\teCOVa{Z{eXi),Ae ■ Z{F,)}\ <^£\ULe 

and 

|cov,{A, • Z{F,),E, • Z{N,)}\ < e^L.X, 

as e 4-0. Applying these bounds to (33) gives 

|cov„{A, • Z{F,),E, ■ Z{Ne)} - X.sieKa{yo)\ 
(34) 

< e|A.^| le^l + £\X.e\Xs + e\ie\Le + e^LeXe. 

Now, from (32), 

|A.ee.i^a(yo)| 



(35) 



Vvar«{Ae • Z(F,)}var„{H, • Z(iV,)} 

|A.,e.^a(yo)l 



< 



A2,Ka(0) + var,{A, • Z(F,)}\/ei^a(0) + var„{H, • Z{Ne)} 
Ka{yo) 



which is in (0, 1) for all yo ^ 0. And, since a <2, 

e\X.e\\ie\ + e\X.e\Xe + e\ie\Le + e^4^£ 



0, 



A2, + e-L2./e + e°^| 
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which, together with (32), (34) and (35), proves (30) and hence (27) and the 
theorem. 

To prove that Theorem 1 also apphes to ordinary kriging, note that by 
setting /3 = ^, (13) and (16) together with (9) imply \\fis\\'i ~ ||%||f as e|0. 
Since fje corresponds to the error of a linear unbiased predictor under the 
constant mean model, we have that the mean squared error of the ordinary 
kriging predictor based on Z{N^) is at least ||t7£||^ and at most ||7?e||p so 
that if (1) holds for the simple kriging predictor it also holds for the ordinary 
kriging predictor. 

5.3. Proof of Theorem 2. Restricting A'g to one point and F^ to 2 al- 
lows us to make use of Lemma 1 to prove (1). Setting y2 = simplifies 
the calculations without changing any essential details. Specifically, defining 
Vix) = Jf (0) - K{x), we win show that 

(36) 

'Z(0)-Z(exi) z{yo)-Z{y^ + eyi] 

Z{exi},Z{yo + eyi), 



has limiting covariance matrix of the form given in (8), from which Theo- 
rem 2 readily follows. 

Let us consider the easier parts of the proof first. Independent of e, the 
variances of the elements of W^ are 1,K{0),K{0) and 1, respectively. Since Z 
has a spectral density, K is continuous and |-f^(y)| < K{0) for all y / 0. 
Thus, cov(VFe2, Wes) — )• K{yo) as e J, 0, and the 2x2 matrix with K{0) on 
the diagonals and K{yo) elsewhere is positive definite. Thus, it suffices to 
show that the other offdiagonal elements of the covariance matrix of W^ tend 
to as e i 0. First, cov{Wei,Ws2) = ^^/V{exi)/K{0) ^ as e i 0. Similarly, 
cov{Ws3, Wei) ^ as e i 0. 

Now consider cav iW fri,Wfrz)- We have 

cov{Z(0)-Z(exi),Z(yo + £yi)}= / e~'^-^y°+'y'\l-e''^--')f{u)du, 
so that for D{T) = {uj ■.\uj ■ xi\ < T}, 

COY{WeuWeZ?< 



.^ {4,|l-e---M/(^)M' 



i^(0)/ig2|l-e*^'^-^-i|2/(w)'^^ 

2{/^(^Jl-e---i|/(..)(ic^} 
(37) < ^ ' 



2 



^(0)/D(T)|l-e'"""M'/(^)c?W 

+ K(0)/^(^)Jl-e— i|2/(a;)dw 



2 
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for all T sufficiently large (to guarantee /^(-p-, |1 — e'^^'^'^^\'^ f {uj) duj > 0). Be- 
cause e"^ 1 1 — e*^"^'^^ I < |w • xi I and e~^ \ 1 — e*^"^'^^ | — )• |a; • xi | as e J, 0, by 
dominated convergence, 

By the Cauchy-Schwarz inequality, 

/^(P). 1 1 - e----^ V(^) dc^ L(T)c /(^) d^ 



(39) < 



/^(y).|l-e^---i|2/(a;)da; 
/(a;)(iw. 



ID{TY 

From (37)-(39), we will have cov(l^£i, VFes) — > as e | if the right-hand 
sides of (38) and (39) tend to as T — )• oo. The integrability of / implies 
Id(ty /('^) dio ^ as T ^ oo, so consider the right-hand side of (38). Let A 
be the 2x2 matrix with first row given by xi, orthogonal rows and deter- 
minant of 1 and set v = {vi,V2y = Auj. Define /(fi) = J^ f{A~^v) dv2- Up 
to a linear rescaling, / is the spectral density of the process Z along the xi 
direction, so it is integrable. In addition, because Z is not mean square differ- 
entiable in any direction, f^ v'lf{vi)dvi = oo. Then for any even function g, 

Id(t)9^^ ' ^i)/('^) d^ — "^ lo ^l^i)/!"*^!) dvi, so that for < S" < T, 

{/g(T)l^-^il/MM^ _ {foVif{vi)dv}^ 

/p(5.)|a;-xi|2/(a;)dw J^vlf{vi)dvi 

(40) 

_ {/o vif{vi)dvi + Jg vif{vi)dviy 



lo'^ifMdvi 
If we can show that 

(41) hm hm Uo^^f(-^)dv, + f^vJiv,)dv,r ^ 

then the right-hand side of (38) will tend to as T — )• oo. To prove (41), 
expand the square in the numerator and consider each term separately. First, 
by the Cauchy-Schwarz inequality, 

^.^^ {Jo vifjvi) dvi}^ ^ ^.^ J^ vlf{vi)dvi /(f f{vi) dvi ^ ^ 

^^°° Io^Ui^i)dvi ^^ J^vUivi)dvi 
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Again by the Cauchy-Schwarz inequality, 

{fjvj{v,)dv,r ^ f^vU{vi)dv,f^f{v^)dv, ^ r^ -^^^^ ^^^ 
Io'"Ui'"i)dvi ~ jQvlf{vi)dvi ~ Js 

which tends to when one takes Hm5_!.oo hni7^_j.oo since / is integrable. 
Finahy, 

/o vif{vi)dvijg vif{vi)dvi /o vifivi)dvijg vif{vi)dvi 
IovU{n)dvi Jgvlf{vi)dvi 



< 



/o vif{vi)dvi Jg vif{vi)dvi 



^ Is vif{vi)dvi 

r-si/2 _ -^ .s 

vif{vi)dvi + -^ 



1 f^'^' - I f' - 

<o/ vif{vi)dvi + - vif{vi)dvi 

^ Jo ^ Js^/^ 



I rS^'^ 



-~E^TT2 I f{vi)dvi+ I f{vi)dvi, 
S^'^ Jo Js^/^ 

which tends to as S — )• oo, and (41) follows. Thus, cov{W£i,We3) — )• as 
e 1 0. Similarly, cov{Ws2,Ws4,) — > as e J, 0. 

We will need the following lemma to handle cov(Wei, We4): 

Lemma 2. If Z is not mean square differentiahle in the direction x, then 

hm -— — - = 0. 

£4,0 V{ex) 

To prove the lemma, first note that the assumption on Z is equivalent to 
(42) / \oj-x\^f{ijj)doj = oo. 



If lim supgj^o yfexT ^ ^' ^^^"^ there must exist some sequence En i along 
which limn,_j.oo :^^ = C for some finite C, or 

f \l — g«£ni^-a;|2 

lim / 5 f{u))duj = C. 

But for any finite T, by dominated convergence, 

f M _ gie„ui-x\2 
C= lim / K f{(jj)duj 

r \1 — e*^"'^'^p 
> lim / f{oj)dijj 

"^^"^JlujlKT En 
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\uj ■ x\ f{uj) doj 

\^\<T 

for all r, which contradicts (42), and the lemma is proven. 
Consider 

cov{Z(0) - Z{exi), Z{y^) - Z{y^ + eyi)} 

Define fi{uj) = min(/(a;), 1), and write covi to indicate covariances calcu- 
lated under the spectral density /i. Then (2) and / integrable imply that 
/(w) = fi{uj) outside some bounded set, and it easily follows that 

cov{Z(0) - Z{exi), Zivo) - Z{yo + eyi)} 
(43) 

= covi{Z(0) - Z{exi),Z{yo) - Z{yo + eyi)} + 0{e^). 

Because Z is not mean square differentiable in any direction. Lemma 2 im- 
plies the O(e^) remainder in (43) makes no contribution to lim^^o cov(W£i,We4) 

We proceed by rotating coordinates so that one of the frequency axes 
points in the direction of y^. Specifically, let B be the 2x2 orthogonal matrix 
with determinant 1 and first row equal to yo and set t = (ri, r2)' = Boj. Then, 
defining iJe(ri,r2) = (1 - e'^(^"'^)-^i)(l - e-''^^~^^>y^), 

covi{Z(0) - Z{exi), Ziyo) - Z{yo + eyi)} 

,. oo ^27r(fc+l) 

= / E / e''-'H,in,T2)MB-^T)dndT2. 

Define the function g on M^ ^y^ foj. 2-Kk < n < 27r(A; + 1), g{B-'^T) = 1 - 
fi{B-^{2TTk,T2y)/fi{B-^T) if /i(5-V) > and otherwise. We have 

covi{Z(0) - Z(exi), Z(yo) - Z{yo + eyi)} 

m -X Ey.(«-' (t)) £'""'.-"^.(n.r.).n.. 

+ / e-'''-y°{l-e'''^-'''){l-e-''''-y')h{uj)g{uj)du. 

By (2), g{uj) — >• as w — )■ oo. Thus, given 5 > 0, we can find T < oo such that 
g{uj) < 6 for |a;| > T. Then 

e-^^-yo{l - e''''-''^){l-e-''''-y')fi{u)g{u)doj 
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<e' 



\ui\<T 

+ A5 



\uj ■ xi\\uj ■ yi\fi{uj)\g{uj)\duj 



\w\>T 

By the Cauchy-Schwarz inequality and /i < /, j\^\yj'\^ ~ e*^'^'^^||l — 
g-iewyij ^ f'^{^uj)duj < ^yV{£Xi)V{eyi), which, together with Lemma 2, im- 
phes 

(45) inn sup — ^ ^ < 4o. 

e;o VF(exi)T/(eyi) 

Since (^ is arbitrary, this linisup must in fact be 0. 

Now return to the the first term on the right-hand side of (44) . Integrating 
by parts, 

f2n{k+l) 

e~'^'H,{Ti,T2) dn = iH,{27r{k + 1), ra) - iH,{27rk, T2) 



2TTk 



f2n{k+l) Q 

I I e~'^^ ^^He{Ti,T2)dTi. 

h-Kk 



dTi 



There exists finite C independent of e and r such that 
d 



which implies 
(46) 



He{n,T2) 



<Ce{\l-e''^^ '^)-^i| + |l-e-^^(^ '^^'^^l, 



r-27r(fe+l) 



/ e-'^'H,{TuT2)dn 

J2iTk 



< 47rCe{|l - e''^^ '^>^i| + |1 - e''' 



ie{B ''-T)-yi 



|}- 



We can choose T finite so that if 2nk < n < 27r{k + 1), then fi{B~^{27rk, 
^2)') < 2/(i?~^r) whenever \t\ > T. Applying this result and (46) to the 
first term on the right-hand side of (44) and changing variables back to 
uj = B^^T, we get 



■ fc=— 00 



(47) 



<8^C7e / f{uj){\l-e''^-'''\ + \l-e"''^-y'\}du + 0{e'^) 



< SnCeiy^Viexi) + V^(eyi)}W / fiu;)du + 0{e^), 
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where the last step uses the Cauchy-Schwarz inequahty. Prom Lemma 2 
and (47), it follows that 

, \kEZ-oofiiB-\%'))£t'^e-^H,in,r,)dndr,\ 
lim sup . = 0. 

Together with (44) and (45), this limit implies 

^. covi{Z(0)-Z(exi),Z(yo)-^(yo + eyi)} ^ 
lim sup -^ = U, 

£4-0 ^V{exx)V{eyi) 

which together with (43) and Lemma 2, implies lim£j^ocov{Wei, W£4} = 0. 

Theorem 2 applies to ordinary kriging as well. Specifically, Z{ex\) is an 
asymptotically optimal linear predictor of ^(0) based on Z{N^ U F^) when 
the mean of Z is assumed to be 0, so since it is a linear unbiased predictor 
when the mean is an unknown constant, Z{ex\) must also be asymptotically 
optimal with respect to this more restricted class of predictors. 

Acknowledgment. The author thanks Steven Lalley for help with the 
proof of Theorem 2. 

REFERENCES 

Bingham, N. H., Goldie, C. M. and Teugels, J. L. (1987). Regular Variation. Ency- 
clopedia of Mathematics and Its Applications 27. Cambridge Univ. Press, Cambridge. 
MR0898871 

Chile;s, ,J.-P. and Delfiner, P. (1999). Geostatistics: Modeling Spatial Uncertainty. Wi- 
ley, New York. MR1679557 

Emery, X. (2009). The kriging update equations and their apphcation to the selection of 
neighbouring data. Comput. Geosci. 13 269-280. 

Gneiting, T. (2002). Nonseparable, stationary covariance functions for space-time data. 
J. Amer. Statist. Assoc. 97 590-600. MR1941475 

JOURNEL, A. G. and Huijdregts, C. J. (1978). Mining Geostatistics. Academic Press, 
New York. 

Kelbert, M. Y., Leonenko, N. N. and Ruiz-Medina, M. D. (2005). Fractional random 
fields associated with stochastic fractional heat equations. Adv. in Appl. Probab. 37 108- 
133. MR2135156 

Memarsadeghi, N. and Mount, D. (2007). Efficient implementation of an optimal in- 
terpolator for large spatial data sets. In Computational Science — IGCS 2007 (Y. Shi, 
G. VAN Albada, J. Dongarra and P. Sloot, eds.). Lecture Notes in Computer Sci- 
ence 4488 503-510. Springer, Berlin. 

Olver, F. W. J., LoziER, D. W., Boisvert, R. F. and Clark, C. W. (2010). NIST 
Handbook of Mathematical Functions. Cambridge Univ. Press, New York. 

Ramm, a. G. (2005). Random Fields Estimation. World Scientific, Singapore. 

Stein, M. L. (1999a). Interpolation of Spatial Data: Some Theory for Kriging. Springer, 
New York. MR1697409 

Stein, M. L. (1999b). Predicting random fields with increasing dense observations. Ann. 
Appl. Probab. 9 242-273. MR1682572 



26 M. L. STEIN 

Stein, M. L. (2002). The screening efltect in kriging. Ann. Statist. 30 298-323. MR1892665 
Stein, M. L. (2005). Space-time covariance functions. J. Amer. Statist. Assoc. 100 310- 

321. MR2156840 
Stein, M. L. (2008). A modeling approach for large spatial datasets. J. Korean Statist. 

Soc. 37 3-10. MR2420389 
Stein, M. L. (2011). On a class of space-time intrinsic random functions. Bernoulli. To 

appear. 
Stein, M. L. and Handcock, M. S. (1989). Some asymptotic properties of kriging when 

the covariance function is misspecified. Math. Geol. 21 171-190. MR0985969 
XuE, Y. and XiAO, Y. (2011). Fractal and smoothness properties of space-time Gaussian 

models. Frontiers Math. China. To appear. DOI:10.1007/sll464-011-0126-9. 

Department of Statistics 

University of Chicago 

Chicago, Illinois 60637 

USA 

E-MAIL: stein@galton. uchicago.edu 



